Disentangled Graph Recurrent Network for Document Ranking
نویسندگان
چکیده
Abstract BERT-based ranking models are emerging for its superior natural language understanding ability. All word relations and representations in the concatenation of query document modeled self-attention matrix as latent knowledge. However, some knowledge has none or negative effect on relevance prediction between document. We model observable unobservable confounding factors a causal graph perform do-query to predict label given an intervention over this graph. For observed factors, we block back door path by adaptive masking method through transformer layer refine disentangled refinement layer. unobserved resolve do-operation from front decomposing into related unrelated parts decomposition Pairwise loss is mainly used ad hoc task, triangle distance introduced both layers more discriminative representations, mutual information constraints put Experimental results public benchmark datasets TREC Robust04 WebTrack2009-12 show that DGRe outperforms state-of-the-art baselines than 2% especially short queries.
منابع مشابه
Hierarchical Recurrent Neural Network for Document Modeling
This paper proposes a novel hierarchical recurrent neural network language model (HRNNLM) for document modeling. After establishing a RNN to capture the coherence between sentences in a document, HRNNLM integrates it as the sentence history information into the word level RNN to predict the word sequence with cross-sentence contextual information. A two-step training approach is designed, in wh...
متن کاملAn Ensemble Click Model for Web Document Ranking
Annually, web search engine providers spend more and more money on documents ranking in search engines result pages (SERP). Click models provide advantageous information for ranking documents in SERPs through modeling interactions among users and search engines. Here, three modules are employed to create a hybrid click model; the first module is a PGM-based click model, the second module in a d...
متن کاملSentence Ranking for Document Indexing
This article discusses a new document indexing scheme for information retrieval. For a structured (e.g., scientific) document, Pasi et al. proposed varying weights to different sections according to their importance in the document. This concept is extended here to unstructured documents. Each sentence in a document is initially assigned weights (significance in the document) with the help of a...
متن کاملDocument Modeling with Gated Recurrent Neural Network for Sentiment Classification
Document level sentiment classification remains a challenge: encoding the intrinsic relations between sentences in the semantic meaning of a document. To address this, we introduce a neural network model to learn vector-based document representation in a unified, bottom-up fashion. The model first learns sentence representation with convolutional neural network or long short-term memory. Afterw...
متن کاملLatent Document Re-Ranking
The problem of re-ranking initial retrieval results exploring the intrinsic structure of documents is widely researched in information retrieval (IR) and has attracted a considerable amount of time and study. However, one of the drawbacks is that those algorithms treat queries and documents separately. Furthermore, most of the approaches are predominantly built upon graph-based methods, which m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Data Science and Engineering
سال: 2022
ISSN: ['2364-1541', '2364-1185']
DOI: https://doi.org/10.1007/s41019-022-00179-3